Microsoft Unveils VibeVoice-ASR for Long-Form Audio
VibeVoice-ASR offers a unified speech-to-text model for 60-minute audio handling.
Records found: 10
VibeVoice-ASR offers a unified speech-to-text model for 60-minute audio handling.
Introducing FLUX.2 [klein], a cutting-edge family of compact models for interactive visual intelligence on consumer hardware.
Discover the innovative LFM2.5 AI models for on-device applications.
NTv3 revolutionizes genomic prediction and design with its multi-species foundation model.
Explore Google's T5Gemma 2, an advanced encoder-decoder model family emphasizing multimodality and long context for developers.
Lux marks a significant advancement in automated computer use models, achieving top scores on the Online Mind2Web benchmark.
Explore the differences between Transformers and MoE models regarding performance and architecture.
NVIDIA and Mistral AI unveil a partnership enhancing AI efficiency with 10x faster inference on GB200 NVL72 systems.
Discover DeepSeek-V3.2, a model designed to enhance reasoning in long-context workloads with reduced costs.
HtFLlib introduces the first unified benchmarking library for evaluating heterogeneous federated learning methods across multiple data modalities, addressing limitations of traditional FL and enabling robust model collaboration.